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We develop some theory of spinal decompositions of discrete and 
continuous fragmentation trees. Specifically, we consider a coarse and 
a fine spinal integer partition derived from spinal tree decompositions. 
We prove that for a two-parameter Poisson-Dirichlet family of con- 
tinuous fragmentation trees, including the stable trees of Duquesne 
and Le Gall, the fine partition is obtained from the coarse one by 
shattering each of its parts independently, according to the same 
law. As a second application of spinal decompositions, we prove that 
among the continuous fragmentation trees, stable trees are the only 
ones whose distribution is invariant under uniform re-rooting. 

1. Introduction. Starting from a rooted combinatorial tree Tj^j with n 
leaves labeled by [n] = {1, . . . , n}, we call the path from the root to the leaf 
labeled 1 the spine of T[„] . Deleting each edge along the spine of T[n] defines a 
graph whose connected components we call bushes. If, as well as cutting each 
edge on the spine, we cut each edge connected to a spinal vertex, each bush 
is further decomposed into subtrees. We thus obtain two nested partitions of 
{2, . . . , n}, which naturally extend to partitions of [n] by adding the singleton 
{!}. We call these partitions of [n] the coarse spinal partition and the fine 
spinal partition derived from T[„]. See, for example. Figure 2. 

The aim of this paper is to develop some theory of spinal decompositions 
of fragmentation trees that arise as genealogical trees of fragmentation pro- 
cesses. We focus on Markovian partition- valued fragmentation processes of 
the following two types. In a setting of discrete time and partitions of [n]. 
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we postulate that each nonsingleton block splits at each time, which leads 
to Markov branching models [4, 18, 26]. In a setting of continuous time 
and partitions of N we postulate a self-similarity condition, which leads to 
self-similar continuum random trees [25, 26]. 

Before giving an overview of this paper in Section 1.3, we formally intro- 
duce the discrete setting in Section 1.1 and the continuous setting in Section 
1.2. 

1.1. Discrete fragmentations. We start by introducing a convenient for- 
malism for the kind of combinatorial trees arising naturally in the context of 
fragmentation processes. Let i? be a finite nonempty set, and write i^B for 
the number of elements of B. Following standard terminology, a partition of 
B is a collection 

n_B = {^1, • • • ,Bk} 

of nonempty disjoint subsets of B whose union is B. To introduce a new 
terminology convenient for our purpose, we make the following recursive 
definition. A fragmentation of B (sometimes called a hierarchy or a total 
partition [35, 36]) is a collection Tg of nonempty subsets of B such that: 

(i) B^Tb, 

(ii) if #B > 2 there is a partition 11^ of i? into at least two parts 
Bi, . . . , Bk, called the children of B, with 

(1) TB = {B}UTB,U---iJTB„ 

where Te. is a fragmentation of Bi for each 1 <i <k. 

Necessarily Bi € Te, each child Bi of B with #Bi > 1 has further children, 
and so on, until the set B is broken down into singletons. We use the same 
notation Tb for both: 

• such a collection of subsets of B, and 

• for the tree whose vertices are these subsets of B, and whose edges are 
defined by the parent / child relation implicitly determined by the collection 
of subsets of B. 

To emphasize the tree structure we may call Tb a fragmentation tree. Thus 
B is the first branch point of Tb, and each singleton subset of i? is a leaf of 
Tb, see Figure 1. It is often convenient to plant Tb by adding a ROOT vertex 
and an edge between the ROOT and the first branch point B. We denote by 
the collection of all fragmentation trees labeled by B. 
For each nonempty subset A oi B, the restriction to A ofTB, denoted 
Ta,b, is the fragmentation tree whose first branch point is A, whose leaves are 
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the singleton subsets of A, and whose tree structure is defined by restriction 
of Tb- That is, Ta,b is the fragmentation 

Ta,b = {CnA:CnA^0,CeTB}eTA, 

corresponding to a reduced subtree as discussed by Aldous [1]. 

Given a rooted combinatorial tree with no single-child vertices and whose 
leaves are labeled by a finite set B, there is a corresponding fragmentation 
tree Tb , where each vertex of the combinatorial tree is associated with the set 
of leaves in the subtree above that vertex. So the fragmentation trees defined 
here provide a convenient way to both label the vertices of a combinatorial 
tree, and to encode the tree structure in the labeling. 

A random fragmentation model is an assignment of a probability distri- 
bution on for a random fragmentation tree Tb with first branch point 
B for each finite subset B of N. We assume throughout this paper that the 
model is exchangeable, meaning that the distribution of 11^, the partition of 





Fig. 2. A fragmentation tree T[g], with coarse spinal partition {{!}, {24569}, {378}}, 
coarse spinal composition ({24569}, {378}, {!}) and fine spinal partition 
{{1},{2},{378},{4},{569}}, 
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B generated by the branching of Tb at its root, is of the form 

(2) ¥{Ub = {Bi,...,Bk})= p{#Bi , . . . , #Bk) 

for all partitions {Bi, . . . , B^} with k>2 blocks, and some symmetric func- 
tion p of compositions of positive integers, called a splitting probability rule. 
The model is called: 

• Markovian (or a Markov branching model) if given Hb = {Bi, . . . ,Bk}, the 
k subtrees of Tb above B are independent and distributed as Tb^ , ■ • ■ , Tb^. , 
for all partitions {Bi, . . . ,Bi^} oi B; 

• consistent if for every A C B, the restriction to A of Tb is distributed like 
Ta; 

• binary if every A E Tb has either or 2 children with probability one, for 
ah B. 

Now we take B = [n]. The collection of vertices at graph distance m > 
above the first branch point form a partition of a subset of [n] that we 

extend to a partition Ilm^ of [n] by adding a singleton {j} for each leaf j at 

height below m. We refer to {Ilm\m > 0) as the partition- valued discrete 
fragmentation process associated with T[„]. See also [4, 18, 26]. 

1.2. Continuous self- similar fragmentations. We denote by P the set of 
partitions of N and equip it with the distance d{7T,7r') = 2~"('^''^), where 
n(7r,7r') is the largest integer such that the restrictions of partitions tt, vr' 
to [n] coincide. Following Bertoin [9], a continuous-time 7^- valued Markov 
process (n(t),t > 0) is called a self- similar fragmentation process with index 
a G M if it is cddldg and: 

• n(0) = {N}, that is, 11 starts from the trivial partition with a unique 
block; 

• n is exchangeable, that is, its distribution is invariant under permutations 
of N; 

• given n(t) = TT, the post-t process {Hit + s), s > 0) has the same law as 
the process whose state at time s > is the partition of N whose blocks 
are those of 

7rinn«(|7r,rs), i>l, 

where {iri,i > 1) is the sequence of blocks of vr in order of least elements, 
(KiM > 1) is the sequence of their asymptotic frequencies and {Il^^\i > 1) 
is a sequence of i.i.d. copies of 11 . 

We recall that Kingman's theorem [27] on exchangeable partitions ensures 
that for every t>0, the asymptotic frequencies |7ri| = lim„^oo n'^f^^iTi n [n]) 
of all blocks tTj of n(t) exist a.s. Bertoin [8] shows that actually a.s. for every 
t, these asymptotic frequencies exist. 
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In [9], Bertoin proved that the distribution of a self-similar fragmentation 
is entirely characterized by three parameters: the index of self-similarity a, a 
coefficient c > that measures the rate of erosion and a dislocation measure 
on 




with no atom at (1,0,...) and that integrates 1 — si. This measure v de- 
scribes the sudden dislocations of blocks, in the sense that a block B 
splits in some blocks i?i , i?2 , • . • with relative asymptotic frequencies s £ 
at rate \B\^i'{ds). When the index a = 0, this fragmentation rate does not 
depend on the size of the blocks and the fragmentation processes is then 
said to be homogeneous. A crucial point is that a self-similar fragmentation 
with parameters a, c and can always be constructed measurably from a 
homogeneous fragmentation with same coefficient c and measure ly, using 
time-changes, and vice versa. We refer to Bertoin's book [10] and the above 
mentioned papers [8, 9] for details on these time-changes and background 
on homogeneous and self-similar fragmentations. 

In this paper, we focus on self-similar fragmentations without erosion 
(c = 0), which are nontrivial {iy{S^) 7^ 0) and do not lose mass at sudden 
dislocations, that is, 




We call (a, the characteristic pair of such a process. 

A family of combinatorial trees with edge lengths > 1, with n ex- 

changeably labeled leaves, is naturally associated to a self-similar fragmen- 
tation process 11 by considering the evolution of 11 restricted to the first n 
integers. Specifically, i2[„] consists of all blocks B that occur in the evolu- 
tion of n n [n] ; an edge between the root and the first branch point [n] has 
as its length the first dislocation time of 11 n [n] , and similarly for subtrees 
with two or more leaves; the edge below leaf j has as its length the time 
between the last relevant dislocation time of 11 n [n] and the time when {j} 
becomes a singleton for 11, which may be infinite. This gives a consistent 
family of trees, in the sense that the subtree of R[n] spanned by [k] is iij^], 
for all k <n, where superfluous (i.e., multiplicity 2) vertices are removed 
and associated edges merged, their lengths summed up. By exchangeabil- 
ity, the same is true in distribution for uniformly chosen k distinct leaves 
of R[n]j relabeled by [k]. The coupling of self-similar fragmentations using 
time-changes entails that the distribution of the combinatorial shapes (say 
T[n]) of R[n],n > 1, depends only on the dislocation measure u, and not on 
the index a. So without loss of generality, we may focus on a = 0, the case 
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of homogeneous fragmentations, when working with the shapes T[„],n > 1. 
Furthermore, (T[„],n > 1) defines a consistent Markov branching model as 
in the previous subsection. Reciprocahy, each consistent Markov branching 
model can be constructed similarly from some homogeneous fragmentation 
(possibly with erosion). See [26]. 

When the index a is negative, small fragments vanish quickly and it is 
well known that the whole fragmentation IT then reaches in finite time the 
trivial partition composed exclusively of singletons. See, for example, [10]. 
In terms of trees, this implies that the height of R^^] is bounded uniformly in 
n. Using the consistency property and Aldous' results [3], it is then possible 
to define the projective limit T of the family {R^n] ,n'>l) and equip it with a 
probability measure fi, the mass measure, that arises as limit of the empirical 
measures on the leaves of > 1. Implicitly, the tree T is rooted. The 

pair {T,fj,) is a continuum random tree (CRT) and was studied in [25] using 
Aldous' formalism of trees as compact metric subsets of h; cf. [1, 2, 3]. An 
alternative formalism can be considered, via the set of equivalence classes 
of compact rooted M-trees endowed with the Gromov-Hausdorff distance, 
as developed in [16, 17]. We will not go further into details here and refer 
to the above-mentioned papers for rigorous definitions and statements. We 
shall call the CRT (T, /i) a self-similar fragmentation CRT with parameters 

A fundamental property of {T,^) is that a version of (i?[„],n > 1) can be 
obtained from a random sampling Li, L2, . . . picked independently according 
to conditional on (T,/i), by considering for each n the subtree of T 
spanned by the root and leaves Li, . . . ,L„. Consider then the forest J^r{t) 
obtained by removing in T all vertices at distance less than t from the root 
and define the random partition 11' (t) by letting i and j be in the same 
block of n'(t) if and only if Li and Lj are in the same connected component 
of J^q-^t), t >0. Clearly the process 11' is distributed as 11. We shall often 
suppose in the following that the fragmentation process we are working with 
is constructed in such a manner from some self-similar fragmentation CRT. 

Examples of self-similar fragmentation CRTs are the Brownian CRT of 
Aldous [1, 2, 3] and, more generally, the stable Levy trees with index (3 € 
(1,2] of Duquesne and Le Gall [14, 15]. For details on their fragmentation 
properties, see Bertoin [9] for the Brownian case (i.e., when (3 = 2) and 
Miermont [29] for the other stable cases. The parameters of these CRTs are 
recalled later in the paper. 

1.3. Contents and organization of the paper. The structure and contents 
of this paper are as follows. In Section 2, we study the coarse and fine 
spinal partitions of some Markov branching model (T[„],n > 1) constructed 
consistently from a self-similar fragmentation process. These partitions of 
[n] are consistent as n varies, which leads to a nested pair of partitions 
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of N. Restricted to N \ {!}, they are jointly exchangeable. In particular, 
they possess asymptotic frequencies a.s. The decreasing rearrangements of 
these frequencies are called the coarse spinal mass partitions and fine spinal 
mass partitions. By decomposing the trees along the spine, we then show 
that when the parameters a and u of the fragmentation are known and u is 
infinite, we can reconstruct the whole self-similar fragmentation process from 
the sequence of shapes (T[„],n > 1) (Proposition 2). Next, the main result 
of this section (Theorem 6) states that under some factorization property 
of the dislocation measure v (Definition 2), the fine spinal mass partition 
derived from the sequence of shapes (r[„],n > 1) is obtained from the coarse 
one by shattering each of its fragments in an i.i.d. manner. 

In particular, this result applies to a family of fragmentations whose dislo- 
cation measures are built from Poisson-Dirichlet partitions (Section 3). The 
stable fragmentations studied by Miermont [29], built from the stable trees 
of Duquesne and Le Gall with index in (1, 2), belong to this family. As a con- 
sequence, we obtain an extensive description, in terms of Poisson-Dirichlet 
partitions (Corollary 10), of spinal decompositions of stable trees. 

The stable trees {T,n) are known to possess an interesting symmetry 
property of invariance under uniform re-rooting (see [2, 13, 15]). Informally, 
this means that taking a leaf at random according to /i and considering T 
rooted at this random leaf, gives a CRT with the same distribution as the 
original CRT with its original root. In Section 4, we give a new proof of this 
invariance, using combinatorial methods, and show that, up to a scaling 
factor, stable trees are the only self-similar fragmentation CRTs that are 
invariant under uniform re-rooting (Theorem 11). 

To finish this introduction, let us mention that studies on spinal decom- 
positions of various trees exist in the literature. See, for example, Aldous- 
Pitman [6] (for Galton- Watson trees), Duquesne-Le Gall [15] (for stable and 
Levy trees). In the fragmentation context, decomposing the trees/processes 
along the spine is a useful tool, which has been used to obtain results on 
large time asymptotics [11], small time asymptotics [24] and discrete ap- 
proximations [26]. 

2. Spinal partitions of fragmentation trees. Decompose a combinatorial 
fragmentation tree T^^] with leaves labeled by [n] along the spine from the 
root to leaf 1 into a collection of bushes by deleting each edge along the 
spine. By adding a conventional root edge to its base, each bush is identified 
with an element of for some B Q [n], where is the collection of 
rooted combinatorial trees with leaves labeled by B. Each such B is 
associated with a unique vertex on the spine of Tj^j . We list these sets of leaf 
labels B in order of the corresponding spinal vertices to obtain an ordered 
exchangeable random partition of {2,...,n}. The first set B in this list 
is the set of elements of [n] not in the block containing 1 after the first 
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fragmentation event involving [n] . If after the first fragmentation of [n] the 
block [n] — B containing 1 is of size 2 or more, the next set is what remains 
of [n] — B after deleting the block containing 1 in the next fragmentation of 
[n] — B, and so on, until the last set which is the singleton {!}. If as well 
as cutting each edge on the spine, we cut each edge connected to a spinal 
vertex, each bush is further decomposed into subtrees. We thus obtain two 
nested exchangeable random partitions of {2, . . . , n}, which naturally extend 
to partitions of [n] by adding the singleton {!}, the coarse and fine spinal 
partitions derived from T[„]. We can include the spinal order in the coarse 
spinal partition to form the coarse spinal composition. 

Assuming that the trees Tj^j,?!^ > 1, are constructed consistently from 
a homogeneous fragmentation process with values in the partitions of N, 
both partitions of [n] are consistent as n varies. Thus the coarse and fine 
spinal partitions may be regarded as a nested pair of random partitions 
of N. These partitions have natural interpretations in terms of associated 
partition- valued self-similar fragmentations processes (n(t),t >0), of any 
index a, in which the sequence {T^n],n > 1) is embedded. For each pair of 
integers i and j let the splitting time Di j be the first time t that i and 
j fall in distinct blocks of 11 (t). Let i,j > 2. By construction, i and j fall 
in the same block of the coarse spinal partition if and only if Di i = Dij, 
whereas i and j fall in the same block of the fine spinal partition if and only 
if Dij > Di^i (this clearly implies Di^i = Dij). Assuming further that 11 is 
constructed by random sampling of leaves Li, L21 • • • from some CRT (T, fx) 
according to fi, i and j fall in the same block of the coarse spinal partition 
if and only if the paths from Lj and Lj to the root first meet the spine of 
T, that is, the path from the root to Li, at the same point. Besides, i and 
j fall in the same block of the fine spinal partition if and only if the path 
from Li to Lj does not intersect the spine. 

The coarse spinal decomposition of T is the collection of equivalence 
classes for the random equivalence relation x ~ 7/ if and only if the paths 
from X and y to the root first meet the spine at the same point on the 
spine. Note that the whole spine itself carries no /i-mass, and spinal non- 
branchpoints (an uncountable set of singletons in this decomposition of T) 
will be excluded from further consideration. The restriction of T to a typi- 
cal equivalence class is a bush which can be further decomposed into trees 
by deleting the point on the spine, and then giving each connected compo- 
nent its own root where it used to be connected to the spine. The resulting 
random partition of T into subtrees is the fine spinal decomposition of T. 

We measure the size of each component of one of these partitions by its 
/Li-mass, to obtain coarse and fine spinal mass partitions of {T,fi), which 
we may regard as two random elements of The following proposition 
summarizes some basic properties of these random partitions, which follow 
easily from the above discussion. 



SPINAL PARTITIONS AND INVARIANCE UNDER RE-ROOTING 



9 



Proposition 1. The coarse and fine spinal partitions derived from the 
sequence of shapes (T[„],n > 1) embedded in {T,fj,) have the following prop- 
erties: 

(i) The singleton block {1} belongs to both partitions of N, while the 
restrictions of these partitions to N\{1} are jointly exchangeable. 

(ii) The sequence of ranked limiting frequencies of each partition of N 
is the sequence of ranked fi-masses of the corresponding mass partition of 

We now offer a more detailed study of these two partitions, first consider- 
ing the coarse spinal partition (and composition), then the fine one and its 
relation to the coarse one. Obviously, the fine spinal partition is identical to 
the coarse one if and only if the trees T[„] are binary for all n > 1 . 

2.1. The coarse spinal partition. Assume throughout this section that 
the trees T[n],n > 1, are constructed consistently from a homogeneous frag- 
mentation process, as when T[„] is derived from an associated continuum 
tree {T,n) as the shape of the subtree spanned by € [n], for Li,L2, . . . 
an exchangeable sample of leaves with directing measure /i. To ease notation 
we work with T^n+i] instead of T[„] . Let 

Bn,l,Bn,2, ■ ■ ■ ,Bn,Kn: {1} 

be the sets of leaves of the bushes derived from the coarse spinal decomposi- 
tion of T^n+i] 5 in order of the corresponding spinal vertices. Then {Bn,i, Bn,2, 
. . . , Bn^Kn) is the restriction to {2, . . . , 1} of an exchangeable ordered ran- 
dom partition of {2,3, . . .}, as studied in [12, 21]. Let 

(3) Cn := {ifBn,l,#Bn,2, • • • , #Bn,K„)- 

It follows easily from sampling consistency of the sequence (r[„],n > 1) that 
{Cn,n > 1) is a regenerative composition structure, as defined in [19]. That is 
to say, {Cn,n > 1) is a sampling consistent sequence of random compositions 
Cn of n, with the property that conditionally given the first part of C.„ is of 
size i < n, the remaining parts of Cn define a random composition of n — i 
with the same distribution as Cn-i- Let 

k 

Sn,k ■='>T' — ^^ #BnJ, 

where Bnj is empty for j > Kn- So [Sn,k + l,fc > 0) is the sequence of 
sizes of the fragment containing 1 as it undergoes successive fragmentations 
according to T^n+i] ■• starting with Snfl = n and terminating with Sn,k = for 
k > Kn, where Kn is the total number of fragmentation events experienced 
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by the block containing 1 in T^^+i]- According to Gnedin and Pitman [19], 
there is the following almost sure convergence of random sets with respect 
to the Hausdorff metric on closed subsets of [0, 1] : 

(4) {Sn,k/n,k> 0}^-^{exp(-6),t > 0}^\ 

where the left-hand side is the random discrete set of values Sn,k rescaled 
onto [0, 1] , and the right-hand side is the closure of the range of the exponen- 
tial of some subordinator {S,t,t>0). The random interval partition of [0, 1] 
defined by interval components of the complement of the closed range of 
1 — has a natural interpretation in terms of the associated CRT (T, 
the lengths of these intervals are the strictly positive masses of components 
in the coarse spinal decomposition of (T, in the order they appear along 
the spine from the root to leaf 1 . We will therefore call this interval partition 
the coarse spinal interval partition of [0, 1] derived from (T, /x). In terms of 
the associated homogeneous fragmentation, the lengths of these intervals are 
the total masses of fragments thrown off by the mass process of the fragment 
containing 1, put in the order they split away from this tagged fragment. 
Otherwise said, exp(— ^) is the mass process of the fragment containing 1. 
Since the fragmentation process has zero erosion and no sudden loss of mass, 
the subordinator ^ has no drift and no killing. Bertoin [8] proved that the 
Levy measure of ^ is then given by 

(5) A{dx) = exp(— x) u{— logg^ g dx), x>0. 

i>l 

Proposition 2. Let (n(t),t > 0) be a self- similar fragmentation pro- 
cess, with index a € M and dislocation measure v with infinite total mass. 
Then the entire process {Il(t),t > 0) can be constructed from the consistent 
sequence (T[,„], n > 1) of combinatorial shapes of trees derived from 11. 

Proof. In view of the time-change relation between fragmentations of 
different indices, it suffices to consider the homogeneous case. Given the con- 
sistent family of trees (T[„],ri > 1), we first use (4) to recover the closure of 
the range of exp(— (^), hence also the closure of the range of the subordi- 
nator describing the evolution of the mass fragment containing 1. Since the 
dislocation measure has infinite mass, so does the Levy measure of ^. Then 
it is well known that the entire sample path of can be measurably recon- 
structed from its range, up to a constant factor on the time scale (see, e.g., 
[22]). Since the distribution of ^ is determined by that of (n(i),t > 0), this 
constant is known. Let n„ = (n„(t),i > 0) be the restriction of (n(t),t > 0) 
to [n]. The path of ^, and its construction (4) from (T[„],n > 1), determine 
almost surely for each n the sequence of random times t when transitions of 
n„ occur which change the block of n„ containing 1, and at each of these 



SPINAL PARTITIONS AND IN VARIANCE UNDER RE-ROOTING 11 

times t the block of n„(t) containing 1 can be read from T[„]. By exchange- 
ability, the same reconstruction can evidently be done almost surely for the 
block of n„(t) containing j, for each 1 <j <n. But this information deter- 
mines the entire path of (n„(i),t > 0), for each n, hence that of (n(t),t > 0). 
□ 



Corollary 3. If in the setting of Proposition 2 we have a < 0, then 
an associated {a, u) -fragmentation CRT (T, /x) can also be constructed from 
(T[„],n > 1) on the same probability space. 

Proof. While the construction of a self-similar fragmentation CRT in 
[25] from a self-similar partition-valued fragmentation process is carried out 
explicitly only "in distribution," it is not hard to give an almost sure con- 
struction, for example, via Aldous' sequential construction in h (see, e.g., 
[3], page 252). This yields an increasing sequence of trees with edge lengths 
that converges in distribution, hence almost surely, with respect to the 
Hausdorff metric on closed subsets of h. The almost sure convergence of 
empirical measures on the leaves of R[n] to a mass measure fx is then given 
by [3], Lemma 7 (convergence of measures is weak convergence). □ 

We record now an explicit distributional result for the coarse spinal parti- 
tion of T^n+i] 1 which can either be read from [19] or derived directly. Recall 
that n+1 — #-Bn,i is the size of the fragment containing 1 at the first branch 
point of T^n+i] - Let S((is) := J2'jLi ^{^j ^ ds) and let A be the Levy measure 
of (Ct)* > 0), which, according to (5), is the image of sS(cis) via s i— > — logs. 
Then by embedding in the homogeneous fragmentation, we see that 

(6) P(#S„,i =m) = $(n:m)/^>(n) (l<m<n), 

where ^>(n) is the total rate of fragmentations with some effect on partitions 
of [n-|- 1], and $(n:m) the rate of such fragmentations in which 1 ends 
up in a block of size 77, -|- 1 — m. From standard results on the construction 
of the homogeneous fragmentation from its dislocation measure v (cf. [10], 
Chapter 3), these rates are easily evaluated as follows: 

^{n:m)=('^ \ /\"+^-'"(l - s)"'S((is) 



(7) 



and 







[n—m)x 







e-^"-™^^(l-e-^)'"A(d2;) 



" PCX) 

(8) $(77)= y «>(r7:7T7)= / (l-s")sS(ds)= / (l-e-"^')A(dx). 
^1 ^0 Jo 
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From this and [19], we get the exchangeable partition probabiHty function 
(EPPF) of the coarse spinal partition {Bn,i, Bn,2, ■ ■ ■ , Bn^Xn} restricted to 
{2, . . . , n + 1}, that is, the probabilities p{ni, . . . , n^) = F{{Bn,i, • • • , Bn,K„} = 
vr), for each particular partition 7rof{2,...,n + l}in sets of sizes ni, . . . , n^, 
Vn > 1, V(ni, . . . ,nfc) partition of n. For an explicit formula, see [19], espe- 
cially formulae (3), (4), (6) and (26). Various further properties of the coarse 
spinal partition can also be read from [19]. 

2.2. The fine spinal partition. We start by observing some basic symme- 
try properties of this partition. 

Proposition 4. (i) Consider the fine spinal partition derived from T^^+i]) 
restricted to {2, . . . , n + 1}. Then, conditionally given the sizes of its compo- 
nents, say ni, ■ ■ ■ ,nk with Y^^=i^i — ^he corresponding collection of sub- 
trees o/T[„_|_]^], modulo relabeling by [ni], . . . , [n^], is a collection of indepen- 
dent copies of r[„^] , . . . , r[„^] . 

(ii) Conditionally given the fine spinal mass partition of a self- similar frag- 
mentation CRT (T, yu) with parameters {a,u), the corresponding collection 
of subtrees T of T , with each T of mass m equipped with m~^fj. restricted 
to T, modulo isomorphism and multiplication of edge lengths by m^, is a 
collection of independent copies of (T, fi) . 

Proof. Part (i) follows easily from the defining Markov (fragmentation/ 
branching) property of T[„]. For part (ii), consider 11 a partition- valued 
(a, i^)-fragmentation constructed from (T, ^u). Let n(-j)(t) denote the block 
of n(t) containing i, i > 1, and recall that Di i denotes the first time at 
which 1 and i belong to distinct blocks. For all t > 0, the collection of blocks 
ij^ii) (Di^i + t),i>l) induces a partition of N. In the terminology of Bertoin 
([10], Definition 3.4), the sequence {Di^i,i > 1) is a stopping line, and as 
such, satisfies the extended branching property ([10], Lemma 3.14), which en- 
sures that given (n(j)(i5i^j),i > 1), the processes (n(j)(L'i^j + t),t > 0), i > 1, 
evolve, respectively, as {mill^'^\mft),t > 0), where mj is the asymptotic fre- 
quency of n(j)(L'i^j),i > 1, and the II^^^s are i.i.d. copies of n. Now, coming 
back to the CRT (T,/i), each component of its fine spinal partition corre- 
sponds to a fragmentation (n(j)(Z)i^j + t),t> 0) for some i and obviously, 
can be measurably reconstructed from this fragmentation (see the proof of 
Corollary 3). Conditionally given the masses mi,i > 1, the subtrees of the 
fine spinal partition are therefore independent, distributed (modulo isomor- 
phisms), respectively, as (m~"T, mj/x^™'* )), i>l, where rn~°'T means that 
the edge lengths of T have been multiplied by m^°' and //K ") is the image 
of ^ by this transformation. □ 
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Part (ii) of the proposition is a natural generalization of the spinal decom- 
position of the Brownian CRT described in [5]. When the Brownian CRT is 
encoded in a Brownian excursion, this corresponds to a path decomposition 
whereby a single excursion is decomposed into a countably infinite collection 
of independent copies of itself. 

In view of this symmetry property of the fine spinal partition, it is natural 
to look for some more explicit description of this decomposition, such as its 
EPPF or the distribution on of the corresponding mass partition. While 
such descriptions are known for the Brownian CRT, and more generally 
for all binary self-similar fragmentation CRTs according to the previous 
section, they appear to be difficult to obtain in general. But searching for 
conditions which simplify the structure of the fine spinal partition of (T, /x) 
leads naturally to consideration of further symmetry properties, and then to 
interesting examples with these properties for which explicit computations 
can be made. Consider first the fine partition of the set of leaves in some 
block of the coarse spinal partition of T^n+i] (restricted to {2, . . . ,n + 1}). 
By recursive arguments, it is enough to discuss the fine partition of the first 
block of the coarse spinal partition. 

For each s G iS-*- let Pg denote the probability measure governing an ex- 
changeable random partition 11 of N whose ranked frequencies are equal to 
s, and for a measure v on let 



be the corresponding distribution of 11 as a mixture of Kingman's paintbox 
partitions. For each n the distribution of n„, is determined by the formula 



for every partition {Bi, . . . , Bk} of [n] into A; > 1 parts, for some function 
Pjj{ni, . . . ,nk) of compositions {ni, . . . ,nk) of positive integers n. We re- 
fer here to [32] or [10] for a specific formula for p^{ni, . . . ,nk). In partic- 
ular, 1) = Xsi (1 — Ei>i ^D'^i.ds). Note that Pu{ni, . . . , n^) < oo for all 
ni,...,nk S N, k>2, if and only if pjy(l,l) < oo, that is, if and only if 
/5i(l <oo. 

Definition 1 . The function is called the exchangeable partition rate 
function [EPRF) associated with v. If is a probability measure, then 
so is Pi,, and p^ is known as an exchangeable partition probability function 
{EPPF). 

Note that we have the addition rule 

Py{ni,..., Uk) = Pu{ni -M, n2, . . . , nfc) H h Pi.(ni, . . . , nfc_i, n^. + 1) 




P,(n„ = {Bi,.. .,Bk]) =pA#Bi, . . . , 



+ p^{7ii,...,ni„l). 
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The following lemma presents a basic decomposition in some generality. 

Lemma 5. Let u be a dislocation measure on with associated EPRF 
Pu- Then for every k>2 and every composition ni, . . . , of n>2 into at 
least two parts, 

(9) P,y{ni, . . . ,nk) = g{n,ni)p^f^^^^^^{n2, . . . ,nk) 

for some function g{n,ni) and some family of probability measures V{n,ni) 
on indexed by 1 < ni < n — 1. 

Proof. Let n be a homogeneous fragmentation with dislocation mea- 
sure I'. The result is obtained by conditioning on the size of the block 
Bi containing 1. We (have to) take g{n,ni) as the total rate associated 
with the formation of a particular block Bi of ni out of n elements. Then 

{^^_}i)gin,ni) = ^{n — l:n — ni) as in (7), so that 

$(n-l)= 5: [I _M5(n,ni)=P,(n„/{N}) 

ni=l 

(10) 



as in (8), is the total rate of formation of partitions of [n] with at least 2 
parts. Then p^(^^ „j)('t-25 • • • is the conditional probability, given the par- 
ticular set B\ , that the remaining n — n\ elements are partitioned as they 
must be to make a particular partition of [n] into blocks of sizes ui, . . . ,nk- 
To be more precise, we can take 

V{n,m){ds) = -^ [ 5]rr^(l-r,)"5,^/(i_,^)(ds)z.(dr), 
g[n,ni) Jsi^ 

where fj is the vector r with component rj omitted. By Kingman's paint- 
box representation and conditioning on the color i of the first block, we 
then get for all partitions with block sizes (ni, . . . ,nfc) in order of least ele- 
ment 

^^(n,ni)K,---,nfc) 

Ps{n2, . . .,nk)V{n,ni){ds) 

Si- 

I , E^"H1 - r.)"-"^Pf,/(i_.,)(n2, . . ..n,)v{dv) 
p^{ni,...,nk), 



g{n 

1 



g{n,ni) 
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where by convention ps = ps^ ■ ^ 

This discussion simphfies greatly for measures i' with the special symme- 
try property introduced in the following definition: 

Definition 2. Let be a measure on S^, and let V he a probability 
measure on S^. Say that u has u as its factor, if P(n, ni) in (9) can be chosen 
identically equal to P for every 1 < ni < n, that is, 

(11) p^{ni,...,nk) = g{n,ni)p^{n2,. .. ,nk) 

for every composition ni,...,nfc of n > 2 into at least 2 parts, and some 
function g{n,ni). 

Note that v may be sigma-finite, but that V is always assumed to be a 
probability measure. It is obvious that if v has factor V, then V is unique. 
A rich class of measures v which admit a factor V is the class of Poisson- 
Dirichlet measures considered in the next section. It is an open problem 
[32], Problem 3.7, even for probability measures, to describe all measures 
v on which admit a factor V. Note that all binary dislocation measures 
trivially admit a factor, as well as ordered Dirichlet(a, . . . ,a) including the 
Dirac mass at (l/m, . . . , 1/m). The latter are just the remaining members 
of the Ewens-Pitman two-parameter family. 

Following the formalism of [31], Corollary 13, given two random elements 
V and V of S^, and a probability distribution V on S^, say that V' is a 
d -fragmentation of V if the joint distribution of V and V' is the same as 
if V is derived from V by shattering each fragment of V independently in 
proportions determined by V. 

Theorem 6. Let u be a dislocation measure on , let be some 

self-similar CRT derived from fragmentation according to u, and let V he a 
probability distribution on . Then the following two conditions are equiv- 
alent: 

(i) the measure v has v as a factor; 

(ii) the fine spinal mass partition of (T, fi) is a V -fragmentation of the 
coarse spinal mass partition of (T, ^u) . 

Proof. According to Pitman ([31], Lemma 35), the fine spinal partition 
is a fragmentation of the coarse spinal partition if and only if, for all n > 1 , 
in passing from the coarse spinal partition of [n] generated by T[„] to the fine 
one, within each block of the coarse partition the fine partition is distributed 
according to Pp, independently between blocks of the coarse partition. So 
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fix some integer n and let Bi, . . . ,Bk be the blocks of the coarse spinal par- 
tition of T[„_|_i] restricted to {2, . . . , n + 1}, with respective sizes ni, . . . , n^. 
Due to the fragmentation property of the trees T[„],n > 1, the correspond- 
ing fine spinal partition of T^^+i] is obtained by splitting independently Bi 

according to „+i„„^), B2 according to P-(n+l-ni,n+l-ni-n2)' • • • > ^fc 

according to i), where V^n + 1, n -|- 1 — ni),d{n + 1 — ni,n + 1 — ni — 

722), . . . ,i?(nfc -|- 1, 1) are probability measures satisfying (9). The fine spinal 
partition is therefore a P-fragmentation of the coarse spinal partition if and 
only if P(n,?ii) can be chosen equal to z? for all 1 < ni < n. □ 

3. Poisson Dirichlet fragmentations. We now turn to a particular fam- 
ily of fragmentation processes, namely the Poisson-Dirichlet fragmentations, 
characterized by dislocation measures of type PD*(a,0), 0<a<l,^>— 2a, 
as defined below by (19). This family generalizes the family of previously 
studied stable fragmentations [29, 30], constructed from the stable trees 
{Tp,np) with index /3, 1 < /3 < 2. These stable CRTs were introduced and 
studied by Duquesne and Le Gall [14, 15] to which we refer for a rigorous 
construction. Roughly, 7g arises as the limit in distribution as n — > 00 of 
rescaled critical Gallon- Watson trees T„, conditioned to have n vertices, 
with edge-lengths n^/^~^ , and an offspring distribution (r/^, > 0) such that 
r/fc ~ Ck~^~^ as k ^ 00. It is endowed with a (random) probability measure 
fj,p which is the limit as n — > cxo of the empirical measure on the vertices of 
Tn- Miermont [29] shows that the partition- valued process constructed by 
random sampling of leaves Li,L2,... from {Tp,ij,p) according to (as ex- 
plained at the end of Section 1.2) is a self-similar fragmentation with index 
1//3 — 1 , and dislocation measure 1^/5 defined for all nonnegative measurable 
function / on .S^^ by 



Tf 



Ai A2 

T ' T ' 



(and no erosion). Here T = J2i^i '^i where Ai > A2 > • • • are the points 
of a Poisson process on (0, 00) with intensity (/3r(l — l/(3))~^x~^^f^~^ dx. 
Besides, cutting the stable tree Tp at nodes (see [30]), Miermont obtained a 
self-similar fragmentation with index 1/(3 and the same dislocation measure 
up. 

3.1. Definition and factorization property. For < a < 1,^ > —a, let 
PD{a,6) denote the two-parameter Poisson-Dirichlet distribution on S^, 
defined as the distribution of the decreasing rearrangement of its size-biased 
presentation, which is 

(13) Wi, (1 - Wi)W2, (1 - Wi){l - W2)W3, . . . 
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for Wi, which are independent beta(l — a,ia + 6) variables. The formula for 
the corresponding EPPF is [32], Theorem 3.2, 

(14) PPD(a,e)(?^i, • • • ,rafc) = Tr—ff\ llil - 

-|- PJn-l -^-j^ 

for every composition (ni,...,nfc) of n, where = r(3; + n)/r(x) is a 
rising factorial. It is evident by inspection of this formula and (11) that 
the probability measure PD(a, 0) admits PD(a, Q + a) as a factor for every 
< a < 1 and 6 > —a. Following Miermont [29] we now consider the rescaled 
measure 

(15) PD*(a,e):=^^i±^PD(a,e), 

which is defined in the first instance for < a < 1 and —a <9. It is known 
([32], Corollary 3.9) that for < a < 1 there is the absolute continuity rela- 
tion 

(16) PB*{a,e){ds) = (Sa(s))''/"PD(a,0)(ds), 

where ^^(s) is the a-diversity which is almost surely associated to a sequence 
s= (si,S2)---) with distribution PD(q,0) by the formula 

(17) S„(s) :=r(l-a) lim js". 

The PT){a,9) distribution is recovered from (16) for —a < 9 hj normaliza- 
tion as in (15). The a-diversity Sa, which has a Mittag-Leffler distribution 
(see, e.g., [32], (0.43)), appears variously disguised in different contexts, for 
example, as a local time variable ([32], page 10), or again as Sa = T~" for 
a positive stable variable T of index a. Indeed, if such a T is constructed 
as T = J2i^i ^* where Ai > A2 > • • • are the points of a Poisson process on 
(0,00) with intensity a(r(l — a))~^ x~'^~^ dx , then 

(Ai/T,A2/r,...)=dPD(a,0) 

and, according to [32], (4.45), 

5«(Ai/T,A2/r,...)=r-° a.s., 

so that for every nonnegative measurable function / of s = (si, S2, • • •) £ S^, 

(18) / /(s)PD*(a,0)(ds)=E[r-V(Ai/r,A2/r,...)]. 

Lemma 7. For each < a < 1, let PD*{a,0) be the measure defined on 
for each real 9 by either (16) or (18). Then for —2a < 9, this measure 
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PD*{a,9) is also the unique measure with no mass at (1,0,0,...) whose 
EPRF is given for k>2 by 

(19) PpD*(a,e){n'i,...,nk) = Y(n + e) 11^^ 

^ ' i=l 

and for k = 1 by the same formula for —a < 9, and by oo for —2a <9 < —a. 
Basic integrability properties of this extended family of Poisson-Dirichlet 
measures are 

(20) / PD*(a,0)(ds) < oo ^ 9>-a; 

(21) / (1 -si)PD*(a,0)(ds) < oo ^ 9 > -2a. 

For each choice of (a, 9) with 9 > —2a the measure PD*{a,9) has the prob- 
ability distribution FD{a,9 + a) as its factor. 

Proof. Following Miermont ([29], Section 3.3) we observe from (14) 
and (15) that the formula (19) holds in the first instance for all 9 > —a, 
and that the right-hand side of (19) is analytic in 9 for Re(0) > —2a, when 
k>2. To get (19) for all 9 > —2a, note that the left-hand side of (19) can be 
written as E[T~^y] where Y is some positive r.v. depending on ni, . . . ,nk 
and then 

E[T-'Y]=E[T''Yl{T<i}]+nT-'Yl{T>i}], 

where the first term is finite for all ^ € M, hence an entire function of 9. So the 
second term for 9 > —a equals a function that is analytic for Re(0) > —2a. 
We claim that this equality of functions extends to 9 > —2a. Indeed, consider 
some nonnegative r.v. Z such that M{t) := E[e*'^] < oo for t < ri and M{t) = 
N(t) for t <ri where N is analytic for Re(i) < r2 with < ri < r2. Then the 
identity for t < ri gives the power series expansion N{t) = X]o° ^"^[^"]/^' 
for \t\ <ri. Since N is analytic for Re(t) < r2, this power series converges 
and this identity holds also for |t| < r2. Hence for < t < r2 we can compute 
by monotone convergence M{t) = t'^E[Z"]/n! = iV(t). Hence (19). 

The fact (20) comes from formula (0.40) and the following line in [32]. As 
for (21), we have seen in Section 2.2 that this integrability condition holds 
if and only if the expressions in (19) are finite for every choice of ni, . . . ,nfc 
with k>2, and this is clear by inspection of (19). □ 

The infinite measure PD*(a,— a) was already used and studied by Bas- 
devant [7] in the context of Ruelle's probability cascades. 
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Remarks. I. For < a < 1, 6* > -a, the EPPF (14) gives 

(22) PpD(a,.)(n„ / {N}) = 1 - 
and hence 

(23) PpD.(„,.)(n„ / {N}) = + (1 - [TT^ j 

in the first instance for O<a<l,0>— a, and then by analytic continuation 
for 0<a<l,^>— 2a, with values of the right-hand side defined by conti- 
nuity for 6 = —a or 6* = —1. To see that the left-hand side of (23) is analytic 
in this range, observe that for each n this function of (a, 9) is just a finite 
sum of the functions in (19) weighted by combinatorial coefficients. 

II. From the fact (13) that a size-biased pick from PD(q;,^) has beta(l — 
a,a + 6) distribution for < a < 1, > —a, we can write down 

(0 < s < 1) 

and hence for —2a<6 by analytic continuation 

PD*(a,.)(., . ..) ^ ,^,%\^l^l,f %^ - sr^~^ 

(24) ' 

(0<s< 1). 

The image of this measure by the change of variable x = — log s is the cor- 
responding Levy measure 

r(i — a)r(l + a + U) 

(25) 

(0 < X < oo). 



From Theorem 6 we now deduce: 



Corollary 8. For each < a < 1,6 > -2a, let {Tafi,^j) be some CRT 
derived from fragmentation process with dislocation measure PD*(a,0). The 
sequence of discrete fragmentation trees (T[„],n> 1) embedded in (7^_e,/x) is 
governed by fragmentations of [n] according the EPPF obtained by normal- 
ization of formula (19) by formula (23). The fine spinal mass partition of 
{Tafi,^) is a FD {a, a + 9) -fragmentation of the coarse spinal mass partition 



20 



B. HAAS, J. PITMAN AND M. WINKEL 



of (Ta^B,^), which is derived from the range of 1 — e ^ for the pure jump 
subordinator ^ with Levy measure (25) and Laplace exponent 

aT{2 + e/a) f{l + e)T{l-a) 



(26) $„,e(z) 



{a + e)T{l-a)\ T{2 + e) 

{z + l + e)T{z + l-a) 



T{z + 2 + e) 
a fT'{z + l-a) r'(l-a 



T{l-a)\T{z + l-a) T{l-a) 



-a. 



Last, for 6 € (—2a, — q) we have an interesting regime where Proposition 
4 appHes along with the asymptotic theory of consistent Markov branching 
models in [26]. Specifically, 

Corollary 9. For 9 G (—2a, —a), let (T[„],n >l) he a Markov branch- 
ing model derived from a self-similar fragmentation with dislocation measure 
PD*(a,^). Adding unit edge lengths to Tj^j, there is the convergence in prob- 
ability 

(07\ \a + e\T{l-a) 

^^^^ aT{2 + e/a) 

for the Gromov-Hausdorff topology, where the limit is a self-similar frag- 
mentation CRT of index 6 + a and dislocation measure PD*(a,0). 

Proof. Note from (24) that 

PD*(a, e){s, <!-.)- I ^.ir^P^.^rl ^m """"' ^ ^ 0- 

|a + t'li (1 — aji (1 + a + t'j 

Then Theorem 2 of [26] applies [A.^^ clearly also satisfies /°° J\-afi{ds) < oo 
for some p> 0], which gives (27). □ 

3.2. Stable fragmentations. The case 1/2 < a < 1 is of special interest. 
Then —2a < —1 < —a, so we can take 9 = —1 in (24), and then the Levy 
measure (25) is of the form 

(28) A(dx) = c;,(l-e-^)-''-^e-^^dx 

for some constant Cfe > and 6 = 1 — a. It is known [19] that if ^ is a sub- 
ordinator with this Levy measure, for any b S (0,1), then the closure of 
the range of e~^ is reversible and identical in law with the zero set of a 
Bessel bridge of dimension 2 — 2b. The corresponding distribution of ranked 
lengths of intervals is then known to be PD(6, 6) ([32], Corollary 4.9). Mier- 
mont ([29], page 444) found the same Levy measure, up to a scaling con- 
stant, for the subordinator associated with the self-similar fragmentation of 
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index a — 1 € (—1/2,0) that he derived from the stable CRT Tp of index 
13 = 1/a € (1,2). Here we have reversed this Hne of reasoning, and con- 
structed Tp directly from combinatorial considerations, without relying on 
the relation between the height process of Tp and the stable process of in- 
dex /?, which was the basis of the work of Duquesne and Le Gall [14, 15]. 
As byproducts of this argument, we have a number of refinements of ear- 
lier work on Tg, which we summarize in the following corollary of previous 
results. 

Corollary 10. For each a G (1/2, 1), corresponding to (3 = 1/a G (1,2) 
the dislocation measure PD*(a, —1) derived from the two-parameter Poisson- 
Dirichlet family as in (19) has PD(a, a — 1) as a factor. Let (T[„] , n = 1, 2 . . .) 
be a consistent family of combinatorial trees governed by fragmentation ac- 
cording to PD*(a,— 1). Then: 

1. The tree T[„] is identical in law to the combinatorial tree with n leaves 
derived by sampling according the mass measure in the stable tree of 
index (3, and may be constructed from the sequence of combinatorial 
trees (T[^],n > 1), as indicated in [26], Theorem 2, or Corollary 3. 

2. The distribution of the coarse spinal mass partition ofTp is PD(1 — a, 1 — 
a). 

3. The coarse spinal interval partition of [0, 1] derived from Tp is exchange- 
able, with the same distribution as the collection of excursion intervals 
of a Bessel bridge of dimension 2a. The (1 — a)-diversity of this interval 
partition — defined in a way similar to (17) — is a multiple of the height 
of a leaf picked at random from the mass measure of Tp . This height has 
the same tilted Mittag-Leffler distribution as a multiple of the local time 
at of the Bessel bridge of dimension 2a. 

4. The corresponding fine spinal mass partition of is a PD(a,a — 1)- 
fragmentation of the coarse spinal mass partition. 

5. The unconditional distribution of the fine spinal mass partition of Tp is 
PD(a, I- a). 

6. The conditional distribution of the coarse spinal mass partition of Tp 
given the fine one is provided by the operator o/PD(7,7) coagulation, as 
defined in [31], for 'j = {1 — a)/a. 

7. Conditionally given the fine spinal mass partition ofTp, the correspond- 
ing collection of subtrees obtained by removing the spine, modulo isomor- 
phism and resettling trees T of mass m is a collection of 
independent copies ofTp. 

Proof. All but items 3, 5 and 6 follow immediately from the previous 
development. Items 5 and 6 are read from items 2 and 4 by the more general 
coagulation/fragmentation duality relation for the PD family provided by 
[31], Theorem 12. 
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The first assertion of item 3 is obvious from previous discussion. To get 
the remaining assertions of item 3, denote by 11 the coarse spinal partition 
of N \ {1} derived from 7g and for each n by Kn the number of blocks of 
n„. On the one hand, Theorem 3.8 and Lemma 3.13 of [32] ensure that 
n°'~^Kn converges a.s. to the (1 — Q)-diversity, that moreover has a tilted 
Mittag-Leffler distribution, given precisely by formula (3.27) of [32]. On the 
other hand, Kn + 1 is the height of leaf 1 in T^n]- Hence by Corollary 9, 
n°'~^Kn converges in probability to a multiple of the height of a leaf taken 
at random from the mass measure of 7g. This gives the second assertion of 
item 3. For the last assertion of item 3, recall from Theorem 5.3 in [33], that 
the decreasing sequence of excursion lengths of a Bessel bridge of dimension 
2a is absolutely continuous with respect to the distribution of the decreasing 
sequence of excursion lengths until time 1 of a Bessel process of dimension 
2a. By [32], Section 4.4, the local time at level until time 1 of this Bessel 
process is proportional to the (1 — Q;)-diversity constructed from the sequence 
of lengths of its excursions. Hence a similar result holds for the Bessel bridge. 



For more information about the distribution of random partitions in the 
PD family, see [20] and [34]. In the limiting case when /? | 2, the above results 
reduce to the description of the interval partition derived from the spinal 
decomposition of the Brownian CRT, which is well known to be distributed 
like the partition generated by excursions of a Brownian bridge. See [5] for 
applications of this decomposition to the asymptotics of random mappings. 
The structure of the fine spinal partition of 7g for 1 < /3 < 2 has no analogue 
for P = 2, because in the Brownian tree all splits are binary. 

4. Invariance under uniform re-rooting. It is of particular interest to 
consider fragmentation trees with additional symmetry properties. A well- 
known property of the stable tree 7^ with index (3 € (1,2], established by 
Aldous [2] for the Brownian CRT with (3 = 2, and by Duquesne and Le Gall 
[15], Proposition 4.8, for /3 G (1,2), is invariance under uniform re-rooting. 
See also [13]. Let us first introduce the discrete analogue of this property. 

For a tree Tj^j with leaves labeled by [n] , let j'^^'^'^^^) denote the tree 
with leaves labeled by [n] obtained by re-rooting T[„] at 1 and relabeling the 
original root by 1. See, for instance. Figure 3. 

Definition 3. Let (T[„] , n > 1) be a consistent Markov branching model. 
We say that the Markov branching model is invariant under uniform re- 
rooting if for all n > 1 



□ 




law „(ROOT«^l) 
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Fig. 3. A fragmentation tree T[g] and its re-rooted counterpart T^j " . 

Note that due to the exchangeabihty of leaf labels, leaf 1 is indeed a 
uniformly picked leaf of the de-labeled combinatorial tree shape. Due to the 
exchangeability of leaf labels, invariance under uniform re-rooting is in fact 
a property of de-labeled combinatorial tree shapes. 



Definition 4. Let (T,/i) be a CRT rooted at p and conditionally on 
(T, /i), let {Li, L2, ■ ■ ■) be a sample of leaves i.i.d. with distribution Let 
then Tt'^il denote the tree T re-rooted at Li. We say that {T,p,) is invariant 
under uniform re-rooting if for all n > 1, the law of the reduced subtree 
TZ{T,Li, . . . ,Ln) of T spanned by the root p and Li,...,L„ is invariant 
under re-rooting at Li, that is, 

7^(T[^ll , p, L2, . . . , L„) 7^(r , 1^12,..., k) 

as an identity in law of combinatorial tree shapes with assignment of edge 
lengths. 

Clearly, the invariance under uniform re-rooting of (T, p) implies the in- 
variance under uniform re-rooting of the sequence (T[„],n > 1) of combina- 
torial trees associated with {T,p). We will see that the converse is false [see 
the arguments after (36)]. 

Remark. In [2, 13] a different formalism is used for the definition of 
invariance under uniform re-rooting, via height functions of ordered CRTs. 
Briefly, assuming that the CRT (T,p) can be encoded into a continuous 
real-valued function H on [0, 1], with H{0) = H{1) = 0, such that: 

• T is isometric to the quotient space ([0, 1],^//)/ where 

dH{x,y):=H{x) + H{y)-2 min H{z) 

and 



^ dH{x,y)=0 
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with the convention [x,y] = [y,x], when y <x, 
• H is the measure induced by the projection of the Lebesgue measure on 
this quotient space, 

then the invariance under uniform re-rooting is defined via 

'I? H where 

U if uniformly distributed on [0, 1] independently of H and 

(29) H^''^{x):=H{u)+H{u + x)-2 min H{z), u,xe[0,l] 

zG[u,x+u] 

with the convention u + x = u + x— 1, when n + x > 1. It was proved in [25] 
that the structures of the combinatorial subtrees TZ{T,Li, . . . ,Ln), n>l, 
derived from some self-similar fragmentation CRT (T,iJ,) can be enriched 
with a consistent "uniform" order so as to encode the fragmentation CRTs 
into a continuous height function as described above, provided the disloca- 
tion measure is infinite. In that context, it is not hard to check that the 
height function definition and Definition 4 above are equivalent. Details are 
left to the reader. 

The goal of this section is twofold: first to give a combinatorial proof, 
different from that given in [2, 13, 15], of the fact that the stable trees are 
invariant under uniform re-rooting; second to prove that among the self- 
similar fragmentation CRTs, the stable trees are the only ones, up to a 
scaling factor, to satisfy this invariance property. 

For the Brownian CRT {T2, /X2), we recall that the partition- valued process 
constructed by random sampling of leaves Li,L2,... according to /i2 is a 
self-similar fragmentation with index a = —1/2 and dislocation measure 1^2 
defined by 1^2 (•5i + S2 7^ 1) = and 

1^2(81 G dx) = (2/7r) 3.-3/2 _ a;)-3/2^^^ 1/2 < x < 1 

(see [9]). The dislocation measure up associated to the stable tree 7g, 1 < 
/3 < 2, is given by (12) and its self-similar index is 1//3 — 1. 

Theorem 11. (i) [2, 15]. For all j3 ^ (1,2], the stable tree (7g,/i^) is 
invariant under uniform re-rooting. 

(ii) Let (T, //) he a self-similar fragmentation CRT with parameters (a, v) 
and suppose it is invariant under uniform re-rooting. Then there exists some 
[3 (1,2] and some constant C > such that u = Cup and a = 1//3 — 1. 

Remark. According to [13], a stronger invariance result is available for 
the height functions H of stable trees (and more generally Levy trees) , which 
is that ffl^l, as defined in (29), is distributed as H for each fixed u € [0, 1]. 
See also [28] for the Brownian CRT. 

The rest of this section is devoted to the proof of Theorem 11. 
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4.1. Spinal decomposition and proof of Theorem The first step is 

to consider the spinal decomposition of trees invariant under uniform re- 
rooting: one consequence of this invariance is that the coarse spinal interval 
partition of [0, 1] derived from the tree is reversible (in fact an exchangeable 
interval partition of [0, 1]; see [19]). The class of trees with this property is 
significantly restricted by the following proposition. 

Proposition 12. Let (T[„],n > 1) be a sequence of combinatorial trees 
associated with some self-similar fragmentation CRT (T, /x) with dislocation 
measure v, and let ^ be the subordinator describing the evolution of the mass 
fragment containing 1 in an associated homogeneous fragmentation process 
(cf. Section 2.1). 

(i) The coarse spinal composition Cn ofn derived from T\^n+i\ [^s defined 
in (3)] is reversible for each n if and only if E, has a Levy measure of the 
form 

(30) k{dx) = c(l - e-^)-^-ie-''^' dx 

for some < 6 < 1 and some constant c > 0. 

(ii) There cannot exist a self-similar fragmentation CRT with a Levy 
measure of this form when b> 1/2 . 

Proof. Part (i) is read from [19], Theorem 10.1, just using that (C„, n > 
1) is a regenerative composition structure. For part (ii), from (5) 

A{dx) =e~^^u{-logSiedx), x > 0, 

i>l 

and (30) we deduce by the transformation z = e~^ , x = — log(z), dx = —dz/z 
''{si e dz) = c(l - z)"^~^z''"2 dz, z e (0, 1). 

i>l 

Since u is supported by decreasing sequences with J2'^i Si = 1, Si <l/i ior 
all i > 1. In particular, 

(31) u{siedz) = c{l- zy''~^z''-^dz, (1/2,1). 
Using the fact that for z G (0, 1/2) 

z-\l-z)'~^>{l-z)-''~'z'~' 




b>l/2, 
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we see that for 6 > 1/2 

/ {1- z)i^{siedz)>c f {I- zY^z^-'^dz 

"'{0,1) "'(1/2,1) 



= c z-\i-zy-'dz 

J(0,l/2) 

>c( {I - z)-^-^ z^-^ dz 

"'(0,1/2) 



> / zv{si € dz) 
^^^0,1) 

by (31). On the other hand, we have 

(1 — z)v[si G dz) 



(0,1) 



/ zv > Si^dz] 

Ao,i) \h ) 

^Siu{ds) 

i>2 

V / Siu{ds) 

V / zi^{si G dz), 



__ '(0,1) 

which contradicts the inequahty obtained in the preceding calculation. □ 

The Levy measure associated with some fragmentation tree invariant un- 
der uniform re-rooting is therefore of the form (30) for some <b< 1/2. We 
recall that the Levy measures associated to /^-stable trees are of this form 
for 6 = 1 — 1//3 (see Section 3.2 for 1 < /3 < 2 and [9] for /3 = 2) which covers 
the range (0,1/2] when (3 varies in (1,2]. 

Proof of Theorem ll(i). Let (Tp,fip) be some stable CRT with index 
P € (1,2]. According to the previous proposition, its coarse spinal interval 
partition of [0,1] is reversible. Items 4 and 7 of Corollary 10 then give a 
construction of this CRT from its coarse spinal interval partition, via its fine 
spinal mass partition, that ensures the invariance under re-rooting property. 
□ 

4.2. Characterization of the dislocation measure and proof of Theorem 
ii(ii). In general the Levy measure does not characterize the dislocation 
measure of the fragmentation tree, that is, two different dislocation mea- 
sures may lead to the same Levy measure A; see Haas [23] for an example. 
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However, this complication no longer arises when the set of fragmentation 
trees is restricted to the ones invariant under uniform re-rooting. 

Proposition 13. Let (T,^) be a self-similar fragmentation CRT with 
parameters (a, zv) and suppose it is invariant under uniform re-rooting. Then 
the dislocation measure v can he re- constructed from the Levy measure A 
associated to the tagged fragment. 

Together with Proposition 12, this implies that: 

Corollary 14. The dislocation measure of a self- similar fragmentation 
CRT invariant under uniform re-rooting is proportional to vp for some (3 G 
(1,2]. 

In order to prove Proposition 13, we first set up two lemmas. In the rest of 
this section, the CRT (T, /x) with parameters (a, u) is fixed and supposed to 
be invariant under uniform re-rooting. A sample of leaves L.j,i > 1, is given 
and we consider the associated partition- valued fragmentation 11. We call 
Pn the probabilities 

p„(ni,...,nfc)=P(n„(t„) = {{l, 

_ pujni, ...,nk) 
$(n-l) ' 

where t^ is the first time when n„ differs from [n] and (ni, . . . ,nfc) denotes 
any composition of n with A; > 2 (in other words, the probabilities p„ are the 
EPPFs obtained by conditioning on {n„ 7^ in the proof of Lemma 

5). Note in particular that 

" n' 1 

where the sum is over all compositions of n; see [32], Exercise 2.1.3. 

Lemma 15. For all compositions (ni, . . . ^n^) of n with k>2 
Pn{ni,...,nk)pni{l,ni -1) 

(33) 

= Pn{n2 H hnfc + l,ni - l)p„_„i+i(l, n2, . . .,nk) 

with the convention, when ni = l, that the probabilities involving expressions 
with a term ni — 1 = are all equal to 1. 



. . . ,ni},{ni + 1, . . . ,ni + n2}, . . . , 
{niH hnfc_i + l,...,n}}) 
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Fig. 4. This configuration always happen with positive probability. 

Proof. Consider the following fragmentation scheme: the first time 
at which the block {l,...,n} splits, it splits in blocks {!,..., ni}, {ni + 
1, . . . , ni + n2}, . . . , {rii + • • • + Uk-i + 1, . . . , n}; then the first of these blocks 
splits in {!}, {2, . . . ,ni}. We are not really interested in the further evo- 
lution of {2, ... , ni}, {ni + 1, . . . , ni + ^2}, . . . , {rii H h nfc„i + 1, . . . , n}; 

let us just say that it is in a configuration which happens with a (strictly) 
positive probability, say r„(ni, . . . ,nk) (e.g., evolutions as in Figure 4). Con- 
sider then the discrete tree with leaf labels obtained from this fragmentation 
scheme. The probability that the tree with n leaves TZ{T, Li, L2, ■ ■ ■ , L^) has 
this labeled shape is exactly 

(34) Pn{ni, nk)pni (1, m - l)r„(ni, . . . , n^). 

Now, look at the same tree rooted at Li, that is, 1Z{T^^ , L2, . . . , Ln); cf. 
Figure 5. Starting from the root Li, the corresponding fragmentation scheme 
evolves as follows: {/>, 2, . . . , n} first splits in {2, . . . , ni}, {p, ni + 1, . . . , n\. 
Then {p, ni + 1, . . . , n} splits in {p}, {ni + 1, . . . , ni + ^2}, . . . , {ni + • • • + 
rik-i + 1, . . . , n}. And the blocks {2, . . . , ni}, {ni + 1, . . . , ni + 712}, . . . , {ni + 
■ ■ ■ + nk-i + 1, . . . , n} then all split according to the same configuration as 
in the previous scheme. By invariance under uniform re-rooting, the subtree 
7^(T[^i1, /9, L2, . . . , L„,) is distributed as TZ{T,Li,L2,. ■ ■ ,Ln), and therefore, 
the probability that TZ{T^^^^ , p, L2, . . . , L„) has this labeled shape is 

(35) p„(ni - 1, 712 H h nfc + l)p„_„j+i(l, n2, . . . , nfc)r„(ni, . . . , n^). 

By invariance under re-rooting, the probabilities in (34) and (35) are equal. 
This yields (33), since r„(ni, . . . ,ni^) ^0. □ 
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Remark. It is easy to check that the probabiHties pn associated to the 
stable trees, which are obtained by normahzation of formula (19) by (23) 
with 6 = —1, satisfy relations (33). 

Lemma 16. The probabilities P3{2, 1), ^3(1, 1, 1) and Pn{^, n — 1), Vn > 
2, are determined by the Levy measure A. 

Proof. Consider n*^, the homogeneous fragmentation constructed from 
n by time-changes. The probabilities pn describe the ordered sizes of blocks 
of at the first time when it differs from [n]. Let D^^, "2, <i <n, be the 
first time in this homogeneous fragmentation at which 1 and i belong to 
separate fragments. Let {X^(t),t > 0) be the decreasing process of masses of 
fragments containing 1. The law of A'' = exp(— ^) is determined by A, as well 
as that of (A'^, -D? 2^ • • • 1 n) since 

P(Z??,2 > S2, . . . , > Sn I A°) = A°(S2) ' " " A°(s„), 

for all sequences of times (s2, . . . , Sn). In particular, knowing A, we know the 
probabilities ¥{Di2 < iiiin3<j<„ Z)]* J =p„(l,n — 1). In the particular case 
when n = 3, this gives ^3(1, 2) and then ^3(1, 1, 1), since 3p3(l, 2) + ^3(1, 1, 
1) = 1. □ 

Remark. It is not hard to see, with a specific example, that in general 
A does not characterize the probabilities Pi{ni, . . . , n^), ni + • • • + n/c = 4. 

Proof of Proposition 13. The dislocation measure is determined, up 
to a scaling constant, by the probabilities Pnin-i, . . . , n^), Vn > 2, V(ni, . . . , rif.) 
composition of n with k>2. The scaling constant is then obtained from A, 



nk 



nl-i 



ROOT ROOT 

Fig. 5. By the invariance under re-rooting assumption, these two configurations are 
equally likely to occur. 
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using (5). The goal here is therefore to check that under the re-rooting as- 
sumption, all the probabilities pn can be recovered from A. Suppose A is 
known. We proceed by induction on n. For n = 2, P2(l) 1) = 1- For n = 3, 
the probabilities are known, by Lemma 16. Suppose now that the p^n^s 
are known, Vm <n—l. By Lemma 16, — 1) is also known. Then, by 

Lemma 15, V(n2, . . . , n^) composition of n — 2, 

p„(2,n2, . . . ,nfc)p2(l, 1) =Pn{l,n- l)p„_i(l,n2, . . • ,nfc), 

which gives p„(2,?i2, . . . The probabilities Pnini, ■ ■ ■ ,ni:), with rii > 3, 
are obtained in the same manner, by induction on ni, thanks to Lemma 15 
[note that Pm (1, ni — 1) 7^ 0, Vni]. Therefore, for all compositions . . . , n^) / 
(1, . . . , 1), A; > 2, of n, we have Pnin-i, . . . , n^), since there is at least one rii^ 1 
and, by symmetry, one can suppose it is ni. It remains to get ^^,(1, . . . , 1), 
which can be done by using equality (32). □ 

Proof of Theorem ll(ii). By Corollary 14, since the law of the CRT 
(T,/i) is invariant under uniform re-rooting, there exists some (3 G (1,2] and 
some constant C such that v = Cup. It remains to prove that the index of 
self-similarity is a = 1/ (3 — 1. Up to now, we only used the combinatorial 
properties of reduced trees encoded in the dislocation measure and not 
the further structure of the CRT (T, ^l) that involves the edge lengths and 
depends on the scaling parameter a. To conclude that a = 1//3 — 1, we must 
consider edge lengths. 

Given the CRT (T,/i) rooted at p and the leaves Li,L2, the reduced tree 
TZ{T ,Li,L2) can be described by the edge-lengths Di2,Di — Di2,D2 — 
Di^2, where Di^2 is the separation time of 1 and 2 in 11 and Di the first 
time at which the block containing i is reduced to a singleton, i = 1,2. By 
invariance under re-rooting, -Di,2 must have the same law as Di — Di^2- We 
already know that this is true for the index 1//3 — 1, from Duquesne-Le 
Gall's Theorem 4.8 in [15]. 

Using time-changes relating 11 and its homogeneous counterpart II'' (these 
time-changes are given specifically in [9]), we have 
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where is the first separation time of 1 and 2 in 11'^. By the strong 
Markov property of 11 (see [9]), \Il^^-^{t + Di2)\ has same distribution as 
|n^j^^(D]' 2)| • |n^j^^(t)|, where II'^ is an independent copy of n*^. Therefore, 

where Di has same distribution as Di and is independent of |n^j^^(Z)]' 2)l~"- 
Assuming that Di^2 has same distribution as Di — D12 and taking expec- 
tations in (36), we obtain 

E[|nJi)(i??,2)niE[i?i] =E[Z?i - Z?i,2] 

=nD,,2]=Em{i-n^\,^{Di2)r])- 

For a < we may cancel the common factor of IE[Di] < 00 [Di is an expo- 
nential functional of a subordinator). It remains to notice that the function 
/(a) = E[|n^j^^(Z)'?^2)l~"] is a strictly monotone function with limit at —00 
and 1 at 0, so that the equation /(a) = 1 — /(a) has a unique solution a, 
which has to be the index o = l//? — 1. □ 

Acknowledgment. We thank Gregory Miermont for many interesting dis- 
cussions. 
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